A general approach for multi-oriented text line extraction of handwritten documents
نویسندگان
چکیده
منابع مشابه
Text Block Recognition in Multi-Oriented Handwritten Documents
Automatic detection of text blocks is an important step before applying OCR or word-spotting techniques to document images. Our approach focusses on handwritten (historical) documents and uses the Gabor Transformation to facilitate this task. Apart from the main text, which often consists of rectangular shaped text blocks, marginalia are of special interest here. These areas are generally uncon...
متن کاملText line detection in handwritten documents
Article history: Received 13 April 2007 Received in revised form 26 March 2008
متن کاملA New Algorithm for Detecting Text Line in Handwritten Documents
Curvilinear text line detection and segmentation in handwritten documents is a significant challenge for handwriting recognition. Given no prior knowledge of script, we model text line detection as an image segmentation problem by enhancing text line structure using a Gaussian window, and adopting the level set method to evolve text line boundaries. Experiments show that the proposed method ach...
متن کاملA Hybrid Approach for Line Segmentation in Handwritten Documents
This paper presents an approach for text line segmentation which combines connected component based and projection based information to take advantage of aspects of both methods. The proposed system finds baselines of each connected component. Lines are detected by grouping baselines of connected components belonging to each line by projection information. Components are assigned to lines accor...
متن کاملText line and word segmentation of handwritten documents
In this paper, we present a segmentation methodology of handwritten documents in their distinct entities, namely, text lines and words. Text line segmentation is achieved by applying Hough transform on a subset of the document image connected components. A post-processing step includes the correction of possible false alarms, the detection of text lines that Hough transform failed to create and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal on Document Analysis and Recognition (IJDAR)
سال: 2011
ISSN: 1433-2833,1433-2825
DOI: 10.1007/s10032-011-0172-6